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VIRAL PEPTIDES WITH STRUCTURAL HOMOLOGY TO PROTEIN G OF RESPIRATORY 
SYNCYTIAL VIRUS 

This invention relates to viruses of the family 
Paramyxoviridae, particularly viruses of the respiratory 
5 syncytial virus group. More particularly, the invention 
relates to the attachment protein of these viruses, and to 
the structure of the region of the attachment protein which 
is involved with binding to the cellular receptor for the 
virus . 

10 

BACKGROUND OF THE INVENTION 

Respiratory syncytial viruses are significant 
pathogens of human and animals throughout the world. 
Virtually all humans become infected with human respiratory 

15 syncytial virus (RSV) by two years of age, and repeated 

infections occur throughout life. RSV is regarded as the 
most serious respiratory pathogen of infants and young 
children, but it can also cause serious disease in 
immunocompromised adults and in the elderly. Serious cases 

20 of infection manifest in bronchiolitis and pneumonia, and 
can be fatal. Estimates of the impact of RSV infection 
indicate that it results in 91,000 hospital admissions 
annually in the United States of America (Heilman, 1990) 
and hospitalisation of 1% of children before the age of 12 

25 months in Britain (Cane and Pringle, 1995) . Epidemics of 
the virus occur on an annual basis coincidental with other 
viruses, such as influenza and parainfluenza. 

Natural immunity does not appear to provide 
protection against >.RSV infection (Mcintosh and Chanock, 

30 1990; Hall, 1994). Even infants provided with maternal 
antibodies are susceptible to RSV infection (Mcintosh, 
1990) . Vaccine development strategies to combat RSV have 
not been successful, and in one study a formalin- 
inactivated virus vaccine actually exacerbated disease 

35 (Mcintosh and Chanock, 1990; Hall, 1994) . The only 

pharmaceutical agent presently available to treat RSV, 
Ribavirin, is expensive and complex to deliver as an 
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aerosol, and is of questionable efficacy (Mcintosh, 1990; 
Levin, 1994). Thus it is apparent that a greater 
understanding of the infectious mechanism and immunobiology 
of RSV is required to develop control measures based on 
5 vaccines or antiviral agents. 

RSV belongs to the Pneumovirus genus of the 
Paramyxoviridae family of single strand negative sense RNA 
viruses, which includes other serious pathogens such as 
parainfluenza, mumps and measles (Mcintosh, 1990; 

10 Kingsbury, 1990) and the recently identified zoonotic, 
equine morbillivirus (Murray et al, 1995) . Like other 
Paramyxoviridase, RSV has two membrane glycoproteins which 
mediate invasion of susceptible cells (Morrison and 
Portner, 1991) . One protein, the large glycoprotein or 

15 G protein, functions in attachment to cells. The other, 
the so-called fusion or F protein, causes fusion between 
the lipid of the viral membrane envelope and the cell 
plasma membrane lipid bilayer. RSV infection can also be 
transmitted by fusion of membranes of infected cells, which 

20 have F protein expressed on their surface, with adjacent 
cells . 

The molecular architecture of the F protein is 
conserved between all members of the three genera of the 
Paramyxoviridae; however, each genus has a characteristic 

25 attachment protein (Morrison and Portner, 1991) . Members 
of the Paramyxovirus genus have attachment proteins with 
neuraminidase and haemagglutinating activities; the 
attachment proteins of the Morbillivirus genus are 
haemagglutinins, but lack neuraminidase activity; and the 

30 attachment proteins of Pneumoviruses lack both 

haemagglutination and neuraminidase properties. Attachment 
proteins of the Paramyxovirus (Morrison and Portner, 1991) 
generus participate in sialic acid receptor-type 
interactions, which account for their ability to 

35 agglutinate red blood cells. RSV is also reported to 

interact with sialic acid; however, the mechanism of RSV 
G protein attachment and the identity of the cellular 



WO 97/46581 



PCT/AU97/00351 



receptor for the G protein are not yet known (Markwell, 
1991) . 

The Paramyxoviridae F protein is invariably a 
type I integral membrane protein, but the attachment 
5 proteins are all type II integral membrane proteins. The 
oligosaccharide compositions of the Paramyxovirus and 
Morbillivirus attachment proteins are typical of integral 
membrane proteins (Morrison and Portner, 1991), but the RSV 
attachment protein, also termed the G protein, appears to 

10 contain an unusually high proportion of carbohydrate 

(Morrison and Portner, 1991; reviewed by Sullender and 
Wertz, 1991) . The gene for the RSV strain A2 attachment 
protein encodes a potential primary translation product of 
298 amino acids with a theoretical Mr of 32588 (Satake et 

15 al, 1985; Wertz et al, 1985), but has an apparent molecular 
weight of 80,000-90,000 (Levine, 1977; Gruber and Levine, 
1983; Lambert and Pons, 1983) as estimated by 
electrophoresis in polyacrylamide gels containing sodium 
dodecylsulf ate (SDS-PAGE) . The unusually high molecular 

20 weight of the RSV attachment protein as measured by 

SDS-PAGE has been attributed to a high content of both O- 
and N-linked oligosaccharides (Gruber and Levine, 1985; 
Lambert, 198 8) . 

It has been suggested that a conserved region in 

25 the central part of the G protein ectodomain may be 

involved in ligand interactions with a cellular receptor 
for the G protein (Collins, 1991; Johnson et al, 1987); 
however, this remains to be demonstrated- Peptide epitope 
scanning experiments, using nested sets of synthetic 

30 peptides, have identified the cysteine-containing region as 
a subtype-specific antigenic determinant (Norrby et al, 
1987) which shows some dependence on oxidation of the 
cysteines to cystine (Akerlind-Stopner et al, 1990) . 
Monoclonal antibodies which react with this region also 

35 block binding of the G protein to cells (Feldman and 
Hendry, 1996) . However, at the priority date of this 
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application the disulphide linkage status of this region 
remained to be elucidated. In addition, the glycosylation 
status of the serine and threonine residues in the 
conserved domain and around the conserved cysteine residues 
5 or the Asn-Pro-Thr sequence between cysteines 176 and 182 
of the ectodomain had not been determined, and the sites of 
disulphide bonds were not known. 

PCT/FR95/01464 by Pierre Fabre Medicament relates 
to a method for recombinant production of an analogue of 

10 RSV protein G, in which there is at least one modification 
of the amino acid sequence in a hydrophobic region of the 
peptide, preferably in a non-t ransmembrane hydrophobic 
region which is not essential for the biological activity 
of the peptide. The modifications are directed to changing 

15 the hydrophobicity of the recombinant production, either by 
deleting a hydrophobic amino acid of the natural sequence, 
or by replacing it with a non-hydrophobic amino acid. This 
results in the recombinant product being exposed on the 
membrane of the host cell by a covalent bond membrane- 

20 anchoring moiety, or in being secreted into the culture 
medium. One of the modifications disclosed replaces 
cysteine at position 173 and/or position 186 by an amino 
acid which cannot form a disulphide bond; this favours 
formation of a disulphide bond between cysteines 176 and 

25 182, which are stated to be critical for the immunogenecity 
of the sequence, and avoids formation of disordered 
disulphide bonds. Thus while this specification does 
suggest that the disulphide bond between cysteines 176 and 
182 is critical, it teaches away from any suggestion that a 

30 disulphide bond between cysteines 173 and 186 could be of 
importance. Furthermore there is no disclosure or 
suggestion of the presence of any glycosylation in this 
region. After the priority date of this application, a 
paper describing the 3-dimensional solution structure, as 
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determined by NMR spectroscopy, of a synthetic 32-residue 
peptide corresponding to residues 58 to 189 of the 
immunodominant central conserved region of protein G of 
bovine RSV has appeared (Doreleijers et al, 1996). This 
5 paper was submitted to the journal on 26 August 1996, and 
consequently is not prior art in respect of this 
application. The peptide was found to form a rigid core 
region comprising two short a-helices connected by type 1 1 
p-turn, and having two disulphide bridges. These were 

10 unambiguously assigned as being between cysteine 173 and 
cysteine 186 (outer bridge) and cysteine 176 and cysteine 
182, on the basis of NMR evidence. It is stated that these 
disulphide bond assignments were subsequently confirmed by 
analysis of proteolytic digestion products and by affinity 

15 measurements (Langedijk et al, 1996; published after the 
priority date) . 

We have now found that the four cysteine residues 
of the ectodomain form a uniform disulphide bond pattern, 
and that the region of the protein around these residues 

20 and the conserved region lacks oligosaccharide attachment. 
We have shown that peptides with these 
characteristics can bind to cells of a type equivalent to 
the in vivo target of RSV, and that the peptides display 
antiviral activity. This is to our knowledge the first 

25 time that a fragment of G protein has been shown to have 
any abnormal activity. This finding has significance for 
the design and production of vaccines, diagnostic reagents, 
and therapeutic compounds for human RSV and related 
viruses . 

30 In particular, the absence of glycosylation 

leaves this region of the ectodomain open for receptor 
binding. Heretofore, the possibility of attached 
oligosaccharide being present in this region has meant that 
the design and interpretation of experiments to examine 

35 binding of the virus to its cellular receptor has been 
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exceedingly difficult, and accurate interpretation almost 
impossible, because of the level of variability made 
possible by glycosylation . 

5 SUMMARY OF THE INVENTION 

According to one aspect, the invention provides a 
compound having structural homology to a contiguous 
sequence of amino acids within the sequence representing 
residues 149-197 of the G protein of respiratory syncytial 
10 virus, in which 

a) no oligosaccharide is linked to potential 
serine, threonine or asparagine attachment sites; 

b) four cysteine residues are involved in 
disulphide linkages; and 

15 c) the pattern of disulphide linkage is Cys 173 

linked to Cys 186, and Cys 176 linked to Cys 182, 

and in which said compound possesses a biological 
activity of respiratory syncytial virus G protein. 

For the purposes of this specification, a 
20 biological activity of respiratory syncytial virus 
G protein is defined as one or more of 

a) the ability to bind to one or more 
antibodies selected from the group consisting of rabbit 
polyclonal antibody directed against RSV, murine monoclonal 

25 antibody directed against RSV, and antibodies present in 
human convalescent sera from patients infected with RSV; 
and 

b) the ability to bind to cells capable of 
being infected with RSV. 

30 Preferably the virus is selected from the group 

consisting of human RSV subtype A, human RSV subtype B, 
bovine RSV, and mutants and variants thereof. 

More preferably the compound is a peptide 
corresponding to amino acids 158 to 196 of the 

35 RSV G protein. Even more preferably the peptide 

corresponds to amino acids 165 to 187 of the RSV G protein. 
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Most preferably the compound is a peptide having 
one of the following amino acid sequences: 



SEQ 


ID 


NO 


1 


KQRQNKPPSKPNNDFHFEVFNFVPCSICSNNPTCWAICKRI PNKKPGKK 


SEQ 


ID 


NO 


2 




SEQ 


ID 


NO 


3 




SEQ 
SEQ 


ID 
ID 


NO 
NO 


4 




5 




SEQ 
SEQ 


ID 
ID 


NO 
NO 


6 




7 




SEQ 


ID 


NO 


8 




SEQ 


ID 


NO 


9 


-S-SKN--K — KD-Y G--QL-KS T — SN--K-- 


SEQ 


ID 


NO 


10 


-S-SKN — K — KD-Y G — QL-KS T — SN — K-- 


SEQ 


ID 


NO 


11 


-P-PKN — K — KD-Y G — QL-KS T — SN — K-- 


SEQ 


ID 


NO 


12 


-P-LKN — K — KD-Y G — QL-KS T — SN — K-- 


SEQ 


ID 


NO 


13 


-P-LKN — K — KD-Y G — QL-KS T-SSN — K-- 


SEQ 


ID 


NO 


14 


-P-LKN — K — KD-Y G — QL-KS T — SN — K — 


SEQ 


ID 


NO 


15 


-S-SKN — K — KD-Y G — QL-KS T — SN — K — 


SEQ 


ID 


NO 


16 


NPSGSI — ENHQDHNN-QTLPY T-EG-LA-LSL-HIETERA-SRA 


SEQ 


ID 


NO 


17 




SEQ 


ID 


NO 


18 





We have additionally surprisingly found that when 
the cysteine residues 173 and 176 in the sequence 

25 representing residues 149-197 of the G protein of RSV are 

blocked, and cysteines 182 and 186 are deleted, the peptide 
still strongly enhances binding of f luorescien-labelled 
peptides of sequences corresponding to residues 14 9-197 and 
163-197 of the G protein sequence, and exerts a strong 

30 anti-viral effect- This suggests that there may be a 

second site for binding of G protein to cellular receptors 
for RSV. 

Consequently, in another aspect the invention 
provides a compound having structural homology to a 
35 contiguous sequence of amino acids within the sequence 

representing residues 149-197 of the G protein of RSV, in 
which at least one of cysteines 173, 176, 182 and 186 is 
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absent or blocked, and in which said compound is not 
glycosylated, and has the ability to inhibit infectivity of 
RSV. 

Any suitable assay for measuring inhibition of 
5 infectivity may be used, for example inhibition of 

cytopathic effect, or inhibition of viral proliferation. 

Sequences encompassing conservative substitutions 
of amino acids are within the scope of the invention, 
10 provided that the biological activity is retained. 

It is to be clearly understood that the compounds 
of the invention include peptide analogues, including but 
not limited to the following: 

1. Compounds in which one or more amino acids 
15 is replaced by its corresponding D-amino acid. The skilled 

person will be aware that retro-inverso amino acid 
sequences can be synthesised by standard methods; see for 
example Choreo and Goodman, 1993; 

2. Peptidomimetic compounds, in which the 

20 peptide bond is replaced by a structure more resistant to 
metabolic degradation. See for example Olson et al, 1993; 
and 

3. Compounds in which individual amino acids 
are replaced by analogous structures for example, 

25 ge/n-diaminoalkyl groups or alkylmalonyl groups, with or 

without modified termini or alkyl, acyl or amine 

substitutions to modify their charge. 

The use of such alternative structures can 

provide significantly longer half-life in the body, since 
30 they are more resistant to breakdown under physiological 

conditions . 

Methods for combinatorial synthesis of peptide 
analogues and for screening of peptides and peptide 
analogues are well known in the art (see for example Gallop 
35 et al, 1994; Hogan, 1997). It is particularly contemplated 
that the compounds of the invention are useful as templates 
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for design and synthesis of compounds of improved activity, 
stability and bioavailability. 

Because of the biological activity of the 
compounds of the invention, they are useful as therapeutic 
and diagnostic agents, and are also useful in screening in 
order to identify compounds capable of inhibiting binding 
of the virus to its host cell. Therefore the invention 
also provides 

a) a diagnostic composition comprising a 
compound of the invention; 

b) a pharmaceutical composition comprising a 
compound of the invention together with a pharmaceutical! y 
acceptable carrier, and optionally together with one or 
more other antiviral agents active against RSV; 

c) a method of prevention or treatment of 
Pneumovirus infection comprising the step of administering 
an effective amount of a compound of the invention to a 
mammal in need of such treatment; and 

d) a method of diagnosis of Pneumovirus 
infection, comprising exposing a biological fluid or sample 
from a mammal suspected of being infected with said virus 
to a compound of the invention, and measuring the 
interaction between the compound and said fluid or sample. 

Diagnostic kits are also within the scope of the 

invention. 

Because at least some compounds of the invention 
are immunogenic, in a further aspect the invention provides 
a method of immunisation against Pneumovirus infection, 
comprising the step of immunising a mammal at risk of such 
infection with an immunising-ef f ective dose of a compound 
of the invention, said compound being immunogenic and 
having the ability to elicit protective antibody. 

It is also contemplated that compounds of the 
invention may be used in conjunction with prior art 
vaccines. Because of the antiviral effect of compounds of 
the invention, this enables the dose of vaccine to be 
reduced, and the risk of side-effects is also reduced. 
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Similarly, even where the antibodies produced in 
response to immunisation with a compound of the invention 
are not protective, such antibodies will be useful as 
diagnostic reagents. For this application of the invention 
5 the only requirement is that the antibodies elicited have 
the capacity to interact with a Pneumovirus in a detectable 
manner. For example, the antibody can be coupled to a 
detectable marker such as a radioactive label, a 
fluorescent marker, a luminescent marker or an enzyme 
10 marker. The person skilled in the art will be aware of a 
great variety of suitable such markers. Thus both non- 
protective and protective antibodies directed against 
compounds of the invention are also within the scope of the 
invention . 

15 Compounds according to the invention may be 

directly labelled with a detectable marker; such as those 
mentioned above for labelling of antibodies, and/or with a 
photoaf finity label, and are useful for identification and 
structural characterization of the cellular receptor for 

20 RSV and other Pneumonoviruses . Knowledge of the structure 
of the receptor and the mechanism of its interaction with 
the G protein is useful in the design of antiviral 
compounds . 

The person skilled in the art will be aware that 
25 anti-idiotype antibodies directed against antibodies 
according to the invention provide useful structural 
information concerning the identity and mechanism of action 
of the receptor site for the G protein. 

While the invention is described in detail with 
reference to human respiratory syncytial virus, it will be 
clearly understood that the invention is applicable to the 
genus Pneumovirus in general, and particularly to bovine 
and ovine RSVs in addition to human RSV . It will be 
evident from the following description that while the 
35 sequence of the G protein of bovine RSV varies to some 
extent, there is significant conservation of a specific 
sequence, and that the cysteine residues are in the same 



30 



WO 97/46581 

- 11 

position as in human RSV G protein. The conservation of 
the disulphide bond bonding pattern in all strains tested 
indicates that this pattern has not varied during evolution 
of the virus, and that it is functionally significant. 

5 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A is a diagrammatic representation of the 
RSV G protein predicted by gene sequence analysis. The 
clear area containing a single cysteine residue (I) is the 
10 cytoplasmic domain, the black region (II) is the 

transmembrane domain and subdomains shaded gray (III & V) 
are the putative heavily glycosylated regions of the 
ectodomain separated by the non-glycosylated disulphide 
subdomain (IV); 
15 Figure IB shows the disulphide arrangement 

determined in this study to involve pairing of cysteine 173 
with cysteine 186, and cysteine 176 with cysteine 182; 

Figure 2 shows the amino acid sequence 
encompassing residues 149-197 of the G proteins of variants 
20 of different subtypes of RSV. Sequences 1-15 are human RSV 
strains, sequence 1 is that of the A2 strain of the 
A subtype (Satake et al, 1985; Wertz et al, 1985), 
sequence 2 is the Long A strain of the A subtype 
(Johnson et al, 1987), and sequences 3-8 are natural 
25 variants of the A subtype isolated in the same locality in 
a single year (Cane et al, 1991) . Sequences 9-15 are 
natural variants of the B subtype isolated in different 
localities over a 29-year period (Johnson et al, 1987; 
Cane et al, 1991; Sullender et al, 1990; Sullender et al, 
30 1991) . Sequence 16 is that of Bovine RSV (Lerch et al, 
1990) . Sequences 17 and 18 are variants of human RSV, 
R10c/1 and R10c/10, which were generated by propagation of 
the Long A strain in the presence of a monoclonal antibody 
directed to the cysteine-containing constant region of the 
35 ectodomain of the G protein (Rueda et al, 1994); 

Figure 3 illustrates the HPLC separation of 
protease fragments of RSV strain A2 G protein produced by 
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tryptic digestion of the entire protein (A) and by 
digestion of the fraction eluting at approximately 
73 minutes during HPLC of the tryptic digest with different 
proteases (B-D) . Chromatograms B, C and D are of digests 
5 obtained using pepsin, thermolysin, and post-proline 
cleavage enzyme, respectively; 

Figure 4 shows MALDI -TOF-MS spectra of components 
of the fraction eluting at approximately 73 minutes during 
HPLC of the tryptic digest of Figure 3A. A spectrum of an 

10 aliquot of the tryptic fraction is shown in A, and a 

spectrum of an aliquot of the unf ractionated peptic digest 
of this tryptic fraction is shown in B. The unf ractionated 
peptic digestion was performed as for Figure 3, except that 
the temperature was 22°C. C and D represent spectra of 

15 fractions eluting at approximately 61 minutes and 

64 minutes during HPLC of the peptic digest (Figure 3B) . 
Spectrum B was recorded in the linear mode. Spectra A and 
B were recorded with matrix 3 and spectra C and D were 
recorded with matrix 4; 

20 Figure 5 shows the proposed identities of peptide 

fragments detected by MALDI -TOF-MS in various digests and 
HPLC fractions. Theoretical m/z values corresponding to 
the proposed fragment identities are presented next to the 
corresponding sequence. All m/z values are for the 

25 oxidized sequences, except for fragments 1R, 2R and 3R, 
which are for reduced forms of these sequences; 

Figure 6 illustrates MALDI -TOF-MS spectra 
obtained by reduction of fragments 3(A) and 2(B) of the 
peptic digest of tryptic fragment 1. Matrix 4 was used to 

30 record both spectra; 

Figure 7 shows MALDI -TOF-MS spectra of an 
unf ractionated thermolytic digest of tryptic fragment 1(A) 
and the thermolytic fragment eluting at approximately 
54 minutes during HPLC of the thermolytic digest of 

35 Figure 3C (B) . Conditions for thermolytic digestion were 
as described for Figure 3 except that 0.5 mM CaCl 2 was 
used. Matrix 4 was used to record both spectra; 
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Figure 8 shows MALDI-TOF-MS spectra of an 
unf ractionated post-proline cleavage enzyme digest of 
tryptic digest 1 (A) , the fraction eluting at 
approximately 63 minutes during HPLC (Figure 3D) of this 
5 digest (B) and an unf ractionated post-proline cleavage 

enzyme digest of peptic fragment 2 (C) . The unf ractionated 
digests were prepared as for Figure 3, except that final 
enzyme concentrations of 0,1 mg/ml and 10 jag/ml were used 
to obtain spectra A and C, respectively. Matrix 4 was used 

10 to record all spectra; 

Figure 9 shows post-source decay fragment ion 
spectra of peptic fragment 2 (A) , peptic fragment 3 (B) and 
post-proline cleavage enzyme fragment 5 (C) . Fragment ions 
1-7, inclusive, correspond to an N-terminal fragment ion 

15 series of the b-type resulting in fragmentation of the 

amino acid residues with or without the peptide, A I C K. 
Fragment ions 8-14, inclusive, correspond to a C-terminal 
fragment ion series of the y-type. This fragmentation 
pattern is presented in a diagrammatic form in Figure 10 

20 together with m/z values for the numbered fragment ions of 
peptic fragment 2. An expanded portion of spectrum A 
(m/z = approximately 850 to 1650) is presented above the 
full spectrum. Matrix 3 was used to record spectra A and B 
and matrix 1 was used to record spectrum C; and 

25 Figure 10 illustrates the proposed fragmentation 

pattern of peptic fragment 2 based on data from Figure 9A. 

Figure 11 shows the sequences of residues 14 9-197 
from human, bovine, and ovine RSV G protein, indicating the 
features which are common to all strains. 

30 Figure 12 shows the sequences of the peptide 

derivatives described herein. 

The cysteinyl residues of all but peptide 4 were 
oxidised to cystine residues with the same linkage 
arrangement as in the native G protein. 

35 The cysteinyl residues of peptide 4 were retained 

in a form protected with the acetamidomethyl functionality. 
Ac denotes an acetylated amino terminus. 
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Amide denotes a carboxyl terminal amide, 
fitc denotes presence of fluorescein 
■ isothiocarbamyl 0-alanine at the amino terminus. 

bb denotes derivatisation of the amino terminus 
5 as a benzoyl benzylamide. 

biot denotes derivatisation of the amino terminus 
as a biotinyl amide. 

Figure 13 shows flow cytometry analysis of 
binding of f luoresceinyl-149-197 to HEp-2 cells. Relative 
10 fluorescence of cells is plotted on the abscissa versus 
relative cell number on the ordinate after incubation of 
cells (1) without or with (2) 468nM, (3) 1.17^iM (4) 2.34nM, 
(5) 4.68jiM, (6) 11.72jiM, (7) 23.44|aM and (8) 46.88|^M 
fluorescent peptide . 
15 Figure 14 shows flow cytometry analysis of 

binding of f luoresceinyl-163-197 to HEp-2 cells and the 
effect of non-fluorescent peptide derivatives on cell bound 
fluorescence. Relative fluorescence of cells is plotted on 
the abscissa versus relative cell number on the ordinate 
20 after incubation of cells with fluorescent peptide at 890nM 
(1) in the absence of non-fluorescent peptide derivatives 
and in the presence of (2) 4.89jiM Acl49-197, (3) 19.6jaM 
Acl63-197 and (4) 22.46^M Acl49-177. 

Figure 15 shows flow cytometry analysis of 
25 binding of f luoresceinyl-149-197 to HEp-2 cells and the 
effect of Acl49-197 on cell bound fluorescence. Relative 
fluorescence of cells is plotted on the abscissa versus 
relative cell number on the ordinate after incubation of 
cells (1) without fluorescent peptide and with fluorescent 
30 peptide at 4 68nM (2) in the absence of non-fluorescent 
peptide derivatives and (3) in the presence of 4.89^iM 
Acl49-197 . 

Figure 16 shows flow cytometry analysis of 
binding of f luoresceinyl-149-197 to HEp-2 cells and the 
35 effect of Acl49-197 oxidised A and B chains of insulin on 
cell bound fluorescence. Relative fluorescence of cells is 
plotted on the abscissa versus relative cell number on the 
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ordinate after incubation of cells (1) without fluorescent 
peptide and with fluorescent peptide at 4 55nM (2) in the 
presence of 3 1 . 6|liM oxidised B chain of insulin, (3) in the 
presence of 22.2|iM oxidised A chain of insulin, (4) in the 
5 absence of non-fluorescent peptide derivatives and (5) in 
the presence of 4.89|liM Acl49-197. 

Figure 17 shows flow cytometry analysis of 
binding of f luoresceinyl-VTRQRRARNGASRS to HEp-2 cells and 
the effect of Acl49-197 on cell bound fluorescence. 

10 Relative fluorescence of cells is plotted on the abscissa 
versus relative cell number on the ordinate after 
incubation of cells (1) without fluorescent peptide and 
with fluorescent peptide at 8jj.M (2) in the absence of non- 
fluorescent peptide derivatives and (3) in the presence of 

15 4 .8 9J1M Acl49-197. 

Figure 18 shows flow cytometry analysis of 
binding of fluoresceinyl -14 9-197 to HEp-2 cells and the 
effect of non-f luore scent truncated peptide derivatives of 
the 149-197 sequence on cell bound fluorescence. Relative 

20 fluorescence of cells is plotted on the abscissa versus 
relative cell number on the ordinate after incubation of 
cells (1) without fluorescent peptide and with fluorescent 
peptide at 455nM (2) in the absence of non-fluorescent 
peptide derivatives and in the presence of (3) 12.2^M 

25 Acl49-197, (4) 48.87^iM Acl63-197, (5) 41.26|iM Acl49-190 and 
(6) 56.1nM Acl49-177. 

Figure 19 shows flow cytometry analysis of 
binding of f luoresceinyl-14 9-197 to HEp-2 cells and the 
effect of benzoylbenzyl-14 9-197 on cell bound fluorescence . 

30 Relative fluorescence of cells is plotted on the abscissa 
versus relative cell number on the ordinate after 
incubation of cells (1) without fluorescent peptide and 
with fluorescent peptide at 468nM (2) in the absence of 
non-f luore scent peptide derivatives and ( 3 ) in the presence 

35 of 4.1|iM benzoylbenzyl-149-197. 
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Figure 20 shows confocal scanning microscopy of 
HEp-2 cells incubated with 4 . 68}_tM fluoresceinyl-14 9-197 
without other peptide derivatives. 

Figure 21 shows confocal scanning microscopy of 
5 HEp-2 cells incubated with 4 . 68fj,M fluoresceinyl-14 9-197 in 
the presence of 3 . IjliM Acl49-197. 

Figure 22 shows confocal scanning microscopy of 
HEp-2 cells incubated with 4 . 68fJ.M f luoresceinyl-149-197 in 
the presence of 22.4|aM Acl49-177. 

10 

DETAILED DESCRIPTION OF THE INVENTION 

Inspection of the deduced amino acid sequence of 
the G protein of many RSV isolates (Satake et al, 1985; 
Wertz et al, 1985; Johnson et al, 1987; Cane et al, 1991; 

15 Sullender et al, 1990; Sullender et al, 1991; Garcia et al, 
1994) reveals several structural domains, which are 
illustrated in Figure 1. The N-terminal region 
(residues 1-38) is located on the inner aspect of the viral 
envelope, and is relatively conserved, as is the 

20 transmembrane region (residues 39-66) . However, the 

ectodomain (232 residues) has two regions of comparative 
variation bordering a central region (residues 149-197), 
which is conserved within subgroups and contains 4 closely 
positioned cysteine residues which are conserved in all RSV 

25 sequences. As shown in Figure 2, this region also has a 
sequence of 13 amino acids, including 2 of the conserved 
cysteine residues, which is identical in all wild type 
isolates of RSV that infect humans. The variable regions 
contain potential sites for N-linked glycosylation of 

30 asparagine, and have an abundance of serine and threonine 
residues which are potential sites for O-linked 
oligosaccharides. Thus the ectodomain comprises 7 
occurrences of the consensus tripeptide sequence 
asparagine-Xaa-threonine/serine, the motif for asparagine- 

35 linked glycosylation, although at three of these sites Xaa 
is proline, which is a contraindication of such 
glycosylation. A relative abundance of proline residues in 
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the regions high in serine and threonine suggests the 
presence of O-linked oligosaccharides. Some functional 
studies indicate an active role for the O- and N-linked 
oligosaccharides of the RSV membrane proteins in cellular 
5 invasion (Lambert, 1988)-, but systematic analyses of the 
structures and positioning of the oligosaccharides remain 
to be performed. 

Experimental Procedures 

10 Peptide Isolation 

Strain A2 RSV G protein was isolated by 
immunoaf f inity chromatography (Walsh, 1984), with 
modifications to include immunoaf f inity columns specific 
for RSV F and nucleocapsid proteins prior to a final 

15 G protein antibody column and elution from the affinity 

column with potassium thiocyanate. Lyophilized G protein 
samples were reconstituted in sufficient 0.1 M NH4HCO3 to 
result in a final concentration of 0.01% Triton X-100 
(v/v) , 0.14 M NaCl and 10 mM phosphate buffer (pH 7.2 in 

20 the absence of NH4HCO3) . N-ethylmaleimide was also added 
to a final concentration of 1 mM. Digestion of the intact 
protein was performed for 4 hours at 37°C using two 
additions of 1% (w/w) of sequencing grade trypsin 
(Boehringer-Mannheim) with the second addition made at 

25 2 hours. Subdigestion of peptide fragments isolated by 

high performance liquid chromatography (HPLC) was achieved 
with pepsin (Boehringer-Mannheim) , thermolysin (Calbiochem) 
and/or post-proline cleavage enzyme (Seikagaku Corporation, 
Tokyo) . Details of individual subdigestion protocols are 

30 described below in association with specific experiments. 

Proteolytic fragments were isolated by reverse 
phase HPLC (RP-HPLC) using slight variations of a 
previously described protocol (Gorman et al, 1990), using a 
2.1 mm x 25 cm column of octadecasilica (Vydac) , a flow 

35 rate of 150 jil/min and a linear gradient from 0.1% (v/v) 
aqueous trif luoroacet ic acid to 80% (v/v) aqueous CH3CN 
containing 0.09% (v/v) trif luoroacetic acid, developed over 
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90 minutes. Gradients were generated using a Hewlett 
Packard chromatography system comprising a 1090M solvent 
delivery system under the control of a DOS Chemstation, and 
elution of peptides was monitored at 214 nm using a 1090 
5 diode array detector. 

Peptide Derivatives 

Peptide derivatives 1-8 (Figure 12) were 

synthesised and purified on a contract basis by Auspep Pty. 
10 Ltd., Parkville, Australia. The peptidyl moiety of 

derivative 9 was synthesised and purified in our 

laboratory. Derivatives 1 and 3 were synthesised by both t- 

Boc/Benzyl and FMoc/t-Butyl solid phase strategies. 

Derivative 2 was synthesised exclusively by the t~ 
15 Boc/Benzyl based solid phase strategy, and other peptides 

were produced by the FMoc/t-Butyl solid phase strategy 

only. 

Amino-terminal derivatisation of peptides 1-8 was 
conducted while the peptides were resin bound, with all 

20 side chain protecting groups intact. Acetylation was 

achieved by acylation with acetic anhydride under basic 
conditions. Benzoylbenzoic acid and biotin were coupled 
using standard coupling reactions. Fluorescent derivatives 
5 and 6 were produced by coupling FMoc-p-alanine to the 

2 5 amino-terminal residue of the nominated sequence and 

subsequently reacting the deprotected amino terminus of the 
p-alanine residues with f luorescein-isothiocyanate under 
basic conditions. Peptide derivatives were cleaved from 
their resin supports under standard acidolytic cleavage 

30 conditions prior to subjecting the cysteinyl residues of 
derivatives 1-3 and 5-8, inclusive, to air oxidation in 
NH4HCO3. Oxidation was monitored by electrospray ionisation 
mass spectrometry, and allowed to proceed until the mass of 
the crude products diminished by 4 Daltons, which indicated 

35 the formation of 2 disulphide bonds from the 4 cysteinyl 

residues of these derivatives. Derivatives 4 and 9 were riot 
subjected to air oxidation, since the cysteinyl residues of 
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5 were maintained in side- chain protected form as 
acetamidomethyl derivatives, and 9 lacked cysteinyl 
residues. Derivative 9 was subjected to amino-terminal 
labelling by direct reaction with fluorescein- 
5 isothiocyanate in an aqueous medium under alkaline 
conditions. 

All peptide derivatives were purified by reverse 
phase high performance liquid chromatography to greater 
than 80% purity, as indicated by integration of peaks in 

10 the chromatogram. In all cases the desired peptide 

derivative was shown to be the predominant product by 
electrospray ionisation mass spectrometry. 

Peptide derivatives were dissolved in deionised 
dististilled water for use in flow cytometry and confocal 

15 scanning microscopy, or dissolved in tissue culture medium 
and sterile filtered for antiviral assays. 

Mass Spectrometry 

Matrix-assisted laser desorpt ion/ionisation time 

20 of flight mass spectrometry (MALDI-TOF-MS ) was performed 
using a Bruker Reflex mass spectrometer (Bruker-Franzen 
Analytik GmbH, Bremen, Germany) . Samples were deposited on 
to target surfaces after mixing with an equal volume of the 
supernatant fraction of a saturated mixture of a-cyano-4- 

25 hydroxy-cinnamic acid (matrix 1) or 3 , 5-dimethoxy-4- 

hydroxy-cinnamic acid (matrix 2) in 33% (v/v) aqueous CH 3 CN 
containing 0.1% (v/v) trif luoroacetic acid (Beavis et al, 
1992) , or a 10 mg/ml solution of a-cyano-4 -hydroxy-cinnamic 
acid in 50% (v/v) C 2 H 5 OH/CH 3 CN (matrix 3), or a 10 mg/ml 

30 solution of 2 , 6-dihydroxyacetophenone in 50% (v/v) 

C2H5OH/CH3CN containing 0.1 M di-ammonium hydrogen citrate 
(matrix 4) (Gorman et al, 1996) . Ionization was achieved 
using a nitrogen laser pulsed at a repetition rate of 3 Hz. 
Laser irradiance was adjusted to threshold levels in order 

35 to observe intact molecular ions, which were accelerated to 
a potential of 28.5 kV on to a hybrid microchannel plate- 
photomultiplier linear detector or subsequently reflected 
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with a reflectron potential of 30 kV on to a dual 
microchannel plate detector. Except where specifically 
noted, molecular ion masses were determined in the 
reflectron mode. Matrix ions were deflected by application 
of a 2 kV potential across deflector plates placed 
immediately after the ion acceleration region in order to 
avoid saturation of the detectors. All measurements were 
made at a digitization rate of 250 Mhz. Masses were 
assigned to intact peptide ions by reference to an external 
calibration file created using the flight times of the 
components of a mixture of 200 fmoles of angiotensin II 
(MH + = 1046.56; monoisotopic mass), 400 fmoles of 
adrenocorticotropic hormone residues 18-39 (MH + = 2466.73; 
average mass) and 2 pmoles of bovine insulin 
(MH + = 5734.54; average mass) applied to a separate target 
spot . 

Analysis of metastable ions arising from post- 
source decay was performed using 25% stepwise decrements in 
the reflectron potential and increasing the laser 
irradiance to optimise production of ions in each voltage 
window (Spengler et al, 1992; Kaufmann et al, 1993; 
Kaufmann et al, 1994) . Masses were assigned to metastable 
ions by reference to a calibration table created by 
determining the behaviour of metastable ions of known mass, 
produced from adrenocorticotropic hormone residues 18-39 , 
at various reflectron potentials (Rouse et al, 1995) . As 
described above, data were acquired at a digitization rate 
of 250 MHz. Assembly of the individual spectra for each 
reflection voltage on to a continuous mass scale was 
performed using Bruker FAST software routines within the 
Bruker XMASS software package. 

Reduction of Disulphide Bonds 

Disulphide containing peptides were reduced, 
after adjusting HPLC fractions to pH 5 by the addition of 
1 M aqueous di-ammonium hydrogen citrate to a final 
concentration of 0.1 M, by addition of 50 mM aqueous 
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tris (2-carboxyethyl) -phosphine (Molecular Probes) to a 
final concentration of 5 mM and incubating the mixtures at 
65°C for 20 minutes. Reduced samples were mixed with an 
equal volume of 2 , 6-dihydroxyacetophenone in 50% (v/v) 
5 C2H5OH/CH3CN and applied to a sample target for mass 
spectrometric analysis . 

Edman Degradation 

Stepwise amino acid sequence analysis of peptides 
10 was performed by automated Edman degradation using a 

Hewlett Packard G1000A solid-phase protein sequenator. 

Cells and Viruses 

Human Laryngeal tumor cells (HEp-2) were grown as 

15 monolayers in Dulbecco ' s modified Eagle's medium ( DMEM ) 

supplemented with 10% (v/v) foetal bovine serum. Cells were 
prepared for flow cytometry, confocal scanning microscopy 
and electron microscopy by washing twice with phosphate- 
buffered saline (PBS) and detached by incubation with ImM 

20 EDTA in PBS for 20 minutes at 37 °C. The cells were 

suspended by agitation and centrifuged at 1000 x g for 5 
minutes, then washed with PBS by centrif ugat ion and 
resuspended in PBS to approximately lxlO 6 cells/ml. Between 
2 and 22jil of solutions of non-fluorescent and/or 

25 fluorescent peptide derivatives were incubated at 0°C for 1 
hr with 2.5 X 10 5 detached cells in 250^1 for these assays. 
These incubation conditions were not detrimental to the 
viability or morphology of the cells. After incubation, 
cells were pelleted by low speed centrif ugation, the 

30 supernatants were decanted and the cells resuspended in 
250|al PBS at 0°C. 

Flow Cytometry 

Propidium iodide was added to the cell suspension 
35 to a concentration of 8pg/ml immediately before the cells 
were analysed by flow cytometry using a Coulter EPICS® 
Elite flow cytometer. The illuminating wavelength was 
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488nm, and forward light scatter, 90° light scatter (side 
scatter) , and fluorescence emission at 525nm (fluorescein) 
and at >600nm (propidium iodide) were recorded. The 
fluorescence of peptide derivatives bound to the live 
5 single cell population was recorded by gating on forward 
and side light scatter, and gating on cells negative for 
propidium iodide fluorescence. 

Con focal Scanning Microscopy 

10 Confocal scanning microscopy was carried out on 

unfixed cells remaining following flow cytometric analysis. 
The cells were mounted in PBS without the addition of anti- 
fade compounds, and observed using a Bio-Rad MRC500 
confocal microscope with illumination from an argon-ion 

15 laser and a 40x0.7 NA objective . The standard fluorescein 
(green) fluorescence filter set was employed. Images were 
accumulated using Kalman filtering. 

Example 1 Disulphide Determination by Analysis of 

20 Proteolytic Fragments 

The whole G protein was digested with trypsin, 
and a fraction of the tryptic digest was then subjected to 
further digestion with pepsin, thermolysin, or post-proline 
cleavage enzyme as described. The digests were analysed by 

25 RP-HPLC, and the results are summarized in Figure 3. 

Chromatogram A resulted from injection of 100 \xl 
of a digest which contained approximately 40 jag of 
G protein exposed to trypsin for 4 hours at 37°C (see 
experimental procedures for detailed conditions) . 

30 Subdigestions of the fraction eluting at 73 minutes in 
chromatogram A were achieved after removal of the CH3CN 
using a stream of high purity nitrogen. Pepsin (10 |il of a 
1 mg/ml solution in 5% (v/v) formic acid) was added to 
100 fil of the fraction and digestion was allowed to proceed 

35 for 2 h at 37°C prior to injecting 100 |al of the digest to 
generate chromatogram B. The fraction was prepared for 
thermolytic digestion by mixing 45 [il of the fraction with 
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45 Hi of 0.1 M NH4HCO3 and 11 jxl of 0.01 M CaCl 2 . 
Thremolysin (10 jil of a 1 mg/ml solution in 0.1 M NH4HCO3) 
was added and digestion allowed to proceed for 2 hours at 
37°C prior to injecting 100 \xl to generate chromatogram C. 
The fraction was prepared for post-proline cleavage enzyme 
digestion by mixing 45 jj.1 of the fraction with 20 \il of 
0.1 M NH4HCO3 before adding 45 (il of 0.1 mg/ml solution of 
the enzyme in the enzyme in 0.1 M ammonium acetate to give 
a final pH of 6.5. Digestion was allowed to proceed for 
2 hours at 37°C prior to injecting 100 \xl of the digest to 
generate chromatogram D. 

The following fragments were produced by 
enzymatic digestion : 



15 



20 



Fragment 1 
Fragments 2 & 3 
Fragment 4 
Fragment 5 

Fragment 6 

Fragment 2R 
Fragment 3R 



Trypsin cleavage of G protein; 
Pepsin cleavage of Fragment 1; 
Thermolysin cleavage of Fragment 1; 
Cleavage of Fragment 1 with post- 
proline protease; 

Cleavage of Fragment 2 with post- 
proline protease; 
Reduced Fragment 2; and 
Reduced Fragment 3. 



25 Cleavage with Trypsin 

RP-HPLC of tryptic digests of the G protein 
consistently revealed only a few discrete peaks of 
absorbance at 214 nm at the column breakthrough and at 
approximately 10.5, 25 and 73 minutes (Figure 3A) . The 

30 remainder of the chromatogram consisted of several series 

of broad baseline rises consisting of poorly resolved peaks 
of absorbance at 214 nm. This appearance is consistent 
with extensive attachment of heterogenous oligosaccharides 
to most of the peptide constituents of the digest. The 

35 peak eluting at 10.5 minutes represented N-ethylmaleimide, 
which was added to the digest as a precaution against 
disulphide bond interchange or disulphide bond oxidation of 
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cysteine-containing fragments. The peak eluting at 
10.5 minutes was not present in a digest from which 
N-ethylmaleimide was omitted. 

The other early-eluting discrete peaks failed to 
5 produce data when examined by MALDI-TOF-MS ; however, as 

shown in Figure 4A, the peak at 73 minutes produced intense 
ion signals at m/z values of 4108.0 and 4125.3. These 
masses are consistent with the tryptic peptide spanning 
residues 152 to 187 of the G protein (Fragment 1), taking 

10 into account oxidation of the four cysteines of this 

sequence to cystine residues, which leads to a loss of 4 Da 
in mass from both peptides. Partial conversion of the 
N-terminal glutamine residue 152 to pyroglutamic acid 
apparently accounts for the difference of 17 Da in mass 

15 between these ions. These data also indicate that 

Fragment 1 does not have any carbohydrate attached to the 
asparagine residue at position 179, the threonine residue 
at position 181, or the serine residues at positions 157, 
174 and 177. 

20 

Subcleavage with Pepsin 

Cleavage of isolated Fragment 1 with pepsin and 
analysis of the unf ractionated digest of MALDI-TOF-MS 
produced ions consistent with residues 152 to 164 and 

25 residues 169 to 187 (Figure 4B) . Ions at m/z = 1653.2 and 
m/z = 1670.0, representing residues 152 to 165, were spaced 
by 17 Da, indicating a mixture of glutaminyl and 
pyroglutamyl residues derived from the N-terminus of the 
original tryptic Fragment 1. Ions at 2096.7 and 2115.7 

30 represented the C-terminal portion of Fragment 1 

(residues 169 to 187), allowing for peptide bond cleavage 
within a cystine loop of the heavier peptide (Fragments 2 
and 3) . As for the parent tryptic peptide, 4 Da had to be 
subtracted from the theoretical masses of the C-terminal 

35 portions to obtain correspondence with the experimentally 
observed masses. This provided further evidence that the 
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four cysteine residues of the sequence were in disulphide 
linkage . 

Fragments 2 and 3 were isolated by RP-HPLC 
(Figure 3B) and analysed as isolated fragments which 

5 provide corroboration of the identities assigned to ions in 
the unf ractionated digest, especially the spacing of 18 Da 
of ions representing the C-terminus of tryptic Fragment 1, 
which indicated intra-disulphide loop peptide bond cleavage 
for the heavier ion. Peaks at 52 and 52.5 minutes 

0 corresponded to N-terminal peptic fragments, the peak of 
61 minutes corresponded to the heavier of the C-terminal 
peptic fragments (Fragment 2; Figure 4C) , and the peak at 
64 minutes corresponded to the lighter of the C-terminal 
fragments (Fragment 3; Figure 4D) . Mass analysis of these 

5 peptides after chemical reduction showed an increase in 
mass of 4 Da for the lighter of the C-terminal fragments, 
consistent with acquisition of four protons as a 
consequence of reduction of two disulphide bonds 
(Figure 6A) . The heavier of the unreduced peptides 

0 actually lost mass, corresponding to loss of the sequence 
Ala-Ile-Cys-Lys and gain of 3 protons following reduction 
of two disulphide bonds (Figure 6B) . These data indicate 
that the peptic cleavage within a disulphide loop to 
produce the heavier of the C-terminal peptic fragments 

5 (Fragment 2) occurred after the sole tryptophan of the 
parent tryptic Fragment 1 (tryptophan 183 of the 
G protein) . 



Subcleavage with Thermolysin 

Cleavage of the tryptic Fragment 1 with 
thermolysin and direct analysis of the digest reaction by 
MALDI-TOF-MS produced ions consistent with cleavages prior 
to phenylalanine residue 19 of Fragment 1 and prior to 
isoleucines at positions 24 and 34 of Fragment 1. Ions 
diagnostic of these cleavages were at m/z values of 912.9 
and 1106.9 (Figure 7A) . The ion at m/z - 912.9 is 
consistent with disulphide bridging between 173 and 186 
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(residues 19-23 of Fragment 1 linked to residues 34-36) , 
and the ion at m/z = 1106.9 is consistent with disulphide 
bridging between cysteines 176 and 182, but without 
cleavage of intra-cystine-loop peptide bonds. RP-HPLC of 
5 the thermolytic digest revealed a complex series of peaks, 
as shown in Figure 3C. Most of these peaks appeared to 
have been derived from the enzyme preparation, as they were 
present in a chromatogram produced using a mock digest 
which lacked Fragment 1. Some of these peaks produced ions 

10 which did not correspond to any portion of Fragment 1. The 
digest had one peak at approximately 38.5 minutes which 
yielded an ion at m/z = 912.2, consistent with the 
cystene 173 to 186 loop. However, this fraction was a 
mixture of peptides, as evidenced by other ions at higher 

15 m/z values, and was not characterized any further. The 
fraction eluting at approximately 54 minutes was not 
present in the mock digest, and yielded the correct mass 
for the peptide containing the cysteine 176 to 182 loop 
(Fragment 4), as shown in Figure 7B. Automated Edman 

20 degradation sequencing produced an N-terminal sequence of 
Ile-Xaa-Ser-Asn, which is consistent with the 
identification of this fragment by mass analysis, with Xaa 
representing a gap corresponding to retention of the PTH 
derivative of one half cystine residue on the sequencer 

25 cartridge due to disulphide linkage to its half cystine 
pair . 

Subcleavage with Post-Proline Cleavage Enzyme 

Cleavage of tryptic Fragment 1 with a post- 
30 proline cleavage enzyme produced a fragment (Fragment 5) 
containing the two disulphides, but without intra- 
disulphide loop cleavage at the proline residue in the 
cystine 176 to 182 loop. Ions with m/z values indicative 
of this fragment (m/z = 1639.7) were observed in both the 
35 digestion reaction mixture (Figure 8A) and in a fraction at 
approximately 63 minutes generated by HPLC of the reaction 
mixture (Figure 3D). This is shown in Figure 8B. Post- 
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proline cleavage of the heavier of the C-terminal peptic 
fragments of Fragment 1 (ie. Fragment 2) also achieved 
cleavage at the extra-cyst ine loop proline but not the 
intra-loop proline (Fragment 6), as indicated by appearance 
5 of an ion at m/z = 1658.8 in the digest illustrated in 
Figure SC. We attempted to obtain further fragments 
consistent with the disulphide bond pattern indicated 
above, using cleavage strategies involving chymotrypsin, 
proteinase K and mild acid; however, these strategies 

10 either failed to contribute additional data or were not 
rewarding at all. 

The residues involved in each fragment, and their 
observed and theoretical m/z ratios, are summarized in 
Table 1, and the structure of each fragment, indicating its 

15 disulphide bonds, is shown in Figure 5. It can be seen 

that the pattern of disulphide bonding is the same in each 
fragment, and that this pattern is destroyed by reduction. 
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Example 2 Disulphide Determination by Mass 

Spectrometric Based Sequence Analysis of 
Fragments 2, 3 and 5 
Analysis of products of gas-phase metastable 
5 decomposition of ions of peptic Fragment 2 by MALDI-TOF-MS 
produced a series of fragment ions illustrated in 
Figure 9A, which were consistent with the disulphide 
bonding pattern deduced by analysis of proteolytic 
fragments (Figure 5) . The series of fragment ions obtained 

10 is represented both diagrammatically and in tabular form in 
Figure 10. These fragment ions are of the b- and y-type 
(Roepstorff and Fohlman, 1984) representing cleavages at 
the peptide bonds along the peptide backbone. Fragment 
ions 1 to 4 inclusive are a series of b-type ions which are 

15 independent of possible disulphide bonding arrangements, 
but fragment ions 5 to 7 inclusive are diagnostic of the 
indicated disulphide linkage, due to the concomitant loss 
of mass of the A I C K sequence together with 
fragmentations at Cysl73 or Serl74 or Ilel75 of the larger 

20 peptide chain. Furthermore, this diagnosis is supported by 
the occurrence of fragment ions 8 to 11, which also bear 
the mass of the A I C K sequence, and by the failure to 
observe this mass accompanying fragment ions 12 to 14 . 

A comparable analysis of peptic Fragment 3 

25 revealed the sequence specific b-type fragment ions 2, 3 

and 4 seen for peptic Fragment 2, which are independent of 
the disulphide bonding pattern which are shown in 
Figure 9B. Fragment ions of Fragment 3 at m/z values of 
1983.3, 1836.7 and 1735.6 are potentially equivalent to 

30 y-type ions 8, 9 and 10 respectively of Fragment 2 when a 
mass difference due to inclusion of an additional peptide 
bond in Fragment 3 is taken into account. Other prominent 
fragment ions of Fragment 3 were apparent at m/z values of 
288.4, 648.9 and 719.1. Only the ions at m/z values of 

35 648.9 and 719.1 were also seen with Fragment 2. The ion at 
m/z = 648.9 can be rationalised as a b-type ion resulting 
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from cleavage between Serl74 and Ilel75 of the peptide, 
plus cleavage of the disulphide bond involving Cysl73. The 
ions at m/z values of 288.4 and 719.1 cannot be easily 
accounted for. None of the fragment ions of Fragment 2 
5 which were used to support the disulphide bonding 

arrangement were observed with Fragment 3. This supports 
the logic used in interpretation of the ion series of 
Fragment 2 used to define the disulphide pattern. This 
logic was dependent upon identifying ions as being 

10 combinations of metastable ion masses due to cleavage along 
the peptide backbone at the N-terminus of the Fragment 2 
plus mass contributed by the disulphide-linked 
A I C K sequence. The A I C K sequence was also linked by 
a peptide bond to Trpl83 in Fragment 3, which explains why 

15 it was not liberated together with metastable fragment ions 
from this peptic fragment. 

As shown in Figure 9C, post-proline cleavage 
Fragment 5 failed to produce metastable fragment ions of 
comparable intensity to those produced by Fragments 2 and 

20 3. These observations with Fragment 5 support the 

conclusions drawn from the data on Fragments 2 and 3. 
Fragment 5 does not have the N-terminal N F V P sequence, 
or the cleavage following Trpl83 required to produce 
fragment ions equivalent to fragment ions 1 to 4 and 8 to 

2 5 11 seen with Fragments 2 and 3. 

As shown in Figure 10 and Table 2, the results of 
analysis of metastable ions produced by peptic Fragments 2 
and 3 and post-proline cleavage Fragment 5 (Figure 9) were 
consistent with the pattern deduced by analysis of 

30 proteolytic fragments as shown in Figure 5. 

Values in Table 2 for the observed fragment ions 
are an average of three separate determinations with peptic 
Fragment 2. Fragment ions 5-7 (b-type ions), inclusive, 
and 8-11 (y-type ions) , inclusive, are diagnostic of the 

35 proposed disulphide pattern, because they account for the 
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mass of the linked peptide, AICK, in addition to the mass 
produced by the cleavage of amino acids as indicated. 



WO 97/46581 PCT/AU97/00351 

32 



CM 
0) 























CD 


















4-> 






CO 


t— t 


CTi 


i — 1 






O 




CM 


1 — 1 


o 


CM 


CM 


LO 




-H 














O 




TJ 


i — i 




LO 


CO 


ro 


VX> 


• 




(D 


o 


lD 


lO 




CM 


CO 


CO 




*4 


o 


CO 


r 




T— 1 


o 


CM 






0\) 


1 — 1 


i — i 


t — 1 


1 — 1 


T 1 


CT\ 


co 


















-p 




GO 


1— 1 






co 


VD 




c 
















V£> 






O 


1 — I 


<-H 


H 


o 


O 




e 
















O 




T3 


+r 


+1 


4-1 


4-1 


4-1 


4-1 




fO 
















+1 




> 


CM 


ro 


CM 


CPt 


r-4 








U 














r-4 




<D 


<—{ 


*T 


lO 


CO 


CO 






i — 1 


CO 


o 


LO 


LO 


LO 


CM 


CO 


CO 




Q 


Q 


CO 


[ — . 


VD 


% 1 


^5 


v m 


c 




CM 


r— 1 


| 


, — | 


, 1 


, 1 


Cn 






































*4 


















(1) 


















E-* 


















1 








o 


t — 1 


CM 


CO 




O 




CO 


cr» 


t — I 


r- ( 


r-H 


t-4 


i-4 




X) 




































-i— * 












CO 


Cm 




r \ 
\J 


CD 


^ 




CM 


L-0 


co 






-«H 


' ' 


CM 


^* 


LO 


CM 








T\ 
\J 














* ¥ j 




n \ 
U> 


LI 7 


CM 


r-4 


CO 


CO 


CO 


Cm 




ti 
( 


III 






i-O 




o 


r-4 




p ( 




CnJ 


co 


^J' 




I — 1 


r-4 


CO 














CM 










CM 


CO 


CM 


LO 






c 














O 


O 








O 


o 


O 


o 






■e 


X5 












+1 


4-1 


Cn 


CD 




+1 


4-1 


4-i 


4-1 






CO 


> 














LO 


M 


*4 








vD 


ON 






Du 


<D 












o 


CO 




CO 




CM 


t— t 


CO 


CM 


CO 


en 


•H 


X) 








LO 


Oi 


o 


r-4 


m 


O 


1 


CM 


CO 




Cm 


r-4 


i-4 


c 


















-H 


















6 




































CD 


















Eh 
1 


















IS 




r-H 


CM 


CO 




lO 




r- 



WO 97/46581 PCT/AU97/00351 

33 

Example 3 Synthesis of Disulphide-Bonded Peptide 

A fully synthetic peptide of sequence 

corresponding to that of Peptide 1 in Figure 2 (SEQ ID 

NO: 1), ie. amino acids 149 to 197 inclusive of the 
5 G protein of RSV strain A2, was prepared using conventional 

solid-phase methods. 

This peptide preferentially formed the same 

disulphide bonding pattern as that identified in the 

previous examples . 

10 

Example 4 Binding of RSV G Protein Residues 149-197 to 

RSV Susceptible Cells 
A series of peptide derivatives was produced 
based on the amino acid sequence and disulphide bond 

15 configuration of residues 149-197 of human RSV A2 strain G 
protein. These peptide derivatives also have features 
common to all strains of human RSV, bovine RSV and ovine 
RSV. These are shown in Figure 12. 

Binding experiments were conducted using RSV- 

20 susceptible HEp-2 cells and f luoresceinyl peptide 

derivatives of the 149-197 sequence, in the presence and 
absence of non-fluorescent peptide derivatives with acetyl 
or benzoylybenzyl substituents on their amino termini. 
Binding of f luoresceinyl peptide derivatives was assessed 

2 5 using flow cytometry and confocal scanning microscopy. 

Peptides corresponding to residues 149-197 and 
163-197 of the A2 strain of human RSV, extended at their 
amino termini by adding (J-alanine followed by reaction with 
fluorescein isothiocyanate, were assesed for their capacity 

30 to bind to uninfected HEp-2 cells using flow cytometry. 

Non-adherent HEp-2 cells exhibited a pronounced increase in 
fluorescence after incubation with f luoreceinyl-149-197 and 
f luoresceinyl-163-197 followed by removal of the 
supernatant by centrif ugation . The degree of fluorescence 

35 increase was dependent upon f luoresceinyl-14 9-1 97 peptide 
concentration over the range of 4 68nM to 4 6.8|iM, but 
saturation was not observed in this concentration range, as 
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shown in Figure 13. Cells incubated with at 890nM and 
8.9|aM f luoresceinyl-163-197 showed a comparable degree of 
fluorescence increase as with the fluorescent derivative of 
149-197 (Figure 14) . 

Non- fluorescent residues Acl4 9-197 ( peptide 
derivative 1) failed to prevent binding of f luoresceinyl- 
14 9-197 { Figure 15) and f luoresceinyl-163-197 (Figure 14 ) . 
On the contrary, the binding of the fluorescent peptide 
derivatives, over the concentration range of 468nM to 
8.9|iM, appeared to be enhanced in the presence of 4 . 9}iM 
Acl49-197 (Figures 14 and 15) . Enhancement of fluorescence 
of bound f luoresceinyl-149-197 was not caused by control - 
peptides such as the isolated oxidised A (22.2^M) or B 
(31.6jaM) chains of insulin (Figure 16) . Unrelated 
f luorescein-labelled peptide derivative 9 produced a modest 
increase in fluorescence of HEp-2 cells (Figure 17) at 8^iM; 
however, the intensity of fluorescence was at least an 
order of magnitude less than with comparable 
concentrations of f luoresceinyl-149-197 or f luoresceinyl- 
0 163-197. The intensity of bound unrelated fluorescent 

peptide was not substantially increased by the inclusion of 
4.89nM Acl49-197 (Figure 17). 

Peptide derivatives synthesised with truncations 
to the N- and C-termini of the 14 9-197 sequence and 
5 derivatised by acetylation of their N-termini (Figure 12) 
also enhanced fluorescence of f luoresceinyl-163-197 
(Figure 14) and fluoresceinyl-14 9-197 (Figure 18) bound to 
HEp-2 cells when added together with the fluorescent 
derivative. While there were variations in absolute 
0 fluorescence values of experiments performed on different 
days, the relative enhancing effects of the peptide 
derivatives was consistently in the order 4>2>3>1. In 
addition to causing the most pronounced enhancement of 
fluorescence, the acetamidomethyl peptide derivative 5 
5 (Figure 12) also caused aggregation of cells (Figures 14 
and 18), and membrane permeability not observed with other 
peptide derivatives . Derivative 7, with benzoylbenzoic acid 
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on the N-terminus of residues 149-197, was also found to 
enhance the fluorescence of HEp-2 cell bound f luoresceinyl- 
149-197 (Figure 19) . 

5 Example 5 Binding and Cappi ng of Peptides 

Binding of f luoresceinyl-149-197 to HEp-2 cells 
was also evident by confocal scanning microscopy in the 
form of peripheral fluorescence on the plasma membrane of 
the cells. In the absence of non-fluorescent peptide 
10 derivatives this fluorescence was in the form of 

patches (Figure 20). However, the fluorescence was 
distributed into larger assemblies when Acl49-197 was added 
to the cells with f luoresceinyl-149-197 under conditions 
that enhance HEp-2 cell fluorescence in flow cytometry 
15 assays. These larger assemblies appeared to form caps with 
a polar distribution on the surface of the cells, and a 
degree of cell aggregation was evident (Figure 21) . 
Confocal microscopy was not conducted with addition of all 
fluorescent derivatives, but the dramatic assembling effect 
20 was also evident with Acl49-177 (Figure 22), which caused 
the most pronounced fluorescence enhancement detectable by 
flow cytometry. The aggregation of cells by Acl49-177 seen 
by flow cytometry was also evident by confocal microscopy 
(Figure 22) . 

25 The enhancement of fluorescence of HEp-2 cell 

bound fluoresceinyl-149-197 and f luoresceinyl-163-197 by 
non-fluorescent derivatives containing various portions of 
the 149-197 sequence is evidence of a complex binding 
interaction between this region of the RSV G protein and 

30 site(s) on cellular receptor (s). This may indicate an 

interaction involving cooperative formation of productive 
ligand interaction (s) or low affinity binding that is 
shifted toward the bound state by the increase in ligand 
concentration caused by inclusion of the non-fluorescent 

35 peptide derivatives. The transition of the fluorescence 
distribution of HEp-2 cell bound derivatives of 149-197 
from a patch appearance to a more dense cap appearance in 
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the presence of non-fluorescent derivatives containing 
various portions of the 14 9-197 sequence is further 
evidence of specific ligand-receptor sites on the 149-197 
region of the RSV G protein. 
5 The demonstration that Acl4 9-177 exhibited a 

profound enhancement of HEp-2 cell bound fluorescence, 
capping of cell bound fluorescence distribution shows that 
a binding interaction site is located in the region of the 
RSV G protein between residues 149-177. This surprising 

10 finding also indicates that the influence of the binding 
site of Acl49-177 does not depend upon disulphide bonds, 
since the two cysteinyl thiols of this peptide derivative 
were protected by the acetamidomethyl group. However, a 
binding interaction may also occur between cellular 

15 receptor (s) and the 149-197 region of the RSV G protein via 
another complementary site that depends upon the disulphide 
bond configuration defined above for the G protein in 

Benzoylbenzyl-149-197 enhanced the fluorescence 
of HEp-2 cell bound f luoresceinyl-149-197 in a similar 

20 manner to Acl49-197, indicating that the benzoylbenzyl 

derivative interacted with cellular receptor (s) and that 
this derivative is useful for photocrosslinking studies 
aimed at identifying the cellular receptors for the RSV G 
protein . 

25 

Example 6 Antiviral Activities of Synthetic Peptide 

Derivatives 

The ability of peptide derivatives to inhibit the 
cytopathic effect (cpe) of RSV on HEp-2 cells was used to 

30 assess whether the peptide derivatives bind to cellular 
receptors for RSV in a biologically relevant manner. 

The impact of peptide derivatives on the 
cytopathic effect (cpe) of the A2 strain of human RSV on 
HEp-2 cells was assessed using monolayers grown in 96-well 

35 plastic tissue culture plates. Serial two-fold dilutions 
of peptide derivatives were added to cell monolayers in 
50|il of sterile tissue culture medium prior to incubation 
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for 1-1.5 hr at 0°C. Virus (50 \xl) was then added and 
incubation continued for 1 hr at 0°C, followed by 4 days at 
37°C without removal of excess peptide derivatives or 
virus. Monolayers were then fixed with formalin and 
5 stained with neutral red. Inhibition of cpe was determined 
by comparison with control cells infected with virus in the 
absence of any pepdtide derivatives. Monolayers were then 
fixed with formalin and viable cells stained with neutral 
red . 

10 The binding of f luoresceinyl-149-197 and 

f luoresceinyl-163-197 to HEp-2 cells, demonstrated by flow 
cytometry and by confocal microscopy in the form of patches 
on the plasma membrane, shows that specific ligand binding 
interaction site(s) for cellular receptor (s) are contained 

15 within this region of the RSV G protein. 

Peptide derivatives 1-4 (Figure 12) inhibited the 
cytopathic effect (cpe) of the A2 strain of human RSV on 
HEp-2 cells to different extents. The IC50 values for 
Acl49-197 and Acl49-190 were approximately 5-10 p.M, which 

20 were comparatively more effective than the other peptide 

derivatives, which had IC50 values of approximately 50 (aM. 
Oxidised A and B chains of insulin failed to inhibit the 
cpe of RSV on HEp-2 cells at 28 an 40 |^M, respectively. 
Oxidised A and B chains of insulin failed to inhibit RSV- 

25 induced cpe of HEp-2 cells when included in these assays at 
28 and 40 \iM, respectively. 



DISCUSSION 

We have now shown that the disulphide bonding 
30 pattern of the ectodomain of the unusual attachment protein 
or G protein of RSV involves a preferred stable 
configuration with Cysl73 linked to Cysl86, and Cysl76 
linked to Cysl82. This was achieved by a combination of 
analysis of proteolytic fragments of the protein and 
35 further analysis of metastable ions produced from the 

proteolytic fragments during MALDI-TOF-MS . These findings 
represent a potent demonstration of the utility of 
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MALDI-TOF-MS for the mass analysis and structural 
elucidation of peptides, with simultaneous characterization 
of post-translational modifications. The observation that 
the synthetic peptide corresponding to residues 14 9 to 197 
5 formed the same disulphide bond arrangement as the viral 

protein strongly indicates that the preferred configuration 
is actually the sole configuration 

It is apparent that the disulphide loop between 
Cysl76 and Cysl82 forms a restricted region of 

10 accessibility, since the proline residue in this loop was 
not vulnerable to post-proline cleavage enzyme under 
conditions where another proline residue preceding the 
disulphide loops was cleaved. In contrast the loop between 
Cysl73 and Cysl86 appears to be more accessible, since 

15 peptide bonds within this loop were cleaved by both pepsin 
and thermolysin. 

Our results are surprising in view of the 
extremely high degree of post-translational glycosylation 
of the RSV G protein. The characteristics of the 

20 chromatogram obtained by HPLC of a tryptic digest of the 
G protein indicated extensive glycosylation of the 
ectodomain. However, it is evident that the region of the 
G protein ectodomain which we have defined does not carry 
any oligosaccharides. Furthermore, the 3 and 10 residues 

25 attached to the N- and C-terminal ends, respectively, of 
this region do not have the potential for glycosylation. 
This represents 49 of the 232 residues of the ectodomain, 
or approximately 20%, which are not glycosylated. 

Thus it is apparent that the ectodomain of the 

30 G protein has a subdomain structure, in which two highly 
glycosylated subdomains of 83 and 101 amino acid residues 
are separated by a comparatively smaller non-glycosylated 
subdomain which has a highly defined disulphide bond 
arrangement. The occupancy status of the remaining 

35 potential glycosylation sites and the characteristics of 
the glycans are yet to be determined. Definition of the 
glycosylation of the subdomains is essential in order to 
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assess the contribution of oligosaccharides to the 
mechanism of action and immunobiology of the G protein 

The region of the G protein ectodomain containing 
the disulphides and the peptide sequences immediately 
5 adjacent to the N- and C-terminal ends of this region 

appear to have functional significance for both receptor 
interactions and immunological reactivity of the G protein. 
A reassessment of earlier antigenic analyses in the light 
of our results indicates an important role for the 

10 disulphides in maintaining the structural integrity of the 
protein. Studies with nested sets of synthetic peptides 
representing overlapping portions of the ectodomain have 
demonstrated that rabbit polyclonal antibodies and murine 
monoclonal antibodies to the G protein and human 

15 convalescent sera from natural infection all react in 
common with a peptide containing three of the four 
cysteines of the ectodomain (Norrby et al, 1987) . The 
rabbit antisera also reacted with a variety of peptides, 
but the convalescent sera only reacted with two other 

20 peptides, one of which overlapped the commonly reactive 

peptide and another closely positioned peptide, while the 
monoclonals only reacted with the commonly reactive 
cysteine containing peptide. It is possible that a wider 
spectrum of antibody reactivities with the G protein might 

25 have been evident had the epitope scanning experiments 
utilised glycosylated domains of the G protein. 

Subsequent studies indicated that the cysteine 
containing region formed a subgroup-specific antigenic 
determinant, and that intact disulphide bonds were 

30 important for this characteristic (Akerlind-Stopner et al, 
1990) . Further support for the immunological importance of 
this region came from studies with escape mutants generated 
using a neutralising monoclonal antibody (Rueda et al, 
1994) . These escape mutants had mutations at either Cysl82 

35 or Cysl86 of their G proteins (Figure 2) . The overall 

effect of these changes was apparently sufficient to enable 
the mutant viruses to escape neutralisation but to retain 
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functionally effective G proteins. Thus, it is possible 
that the mutant viruses had subtle, but immunochemically 
significant, differences in the surface chemistry of the 
cysteine regions of the ectodomains of their G proteins, 
5 while retaining a functionally competent structural fold. 

From our results it is apparent that these mutants retained 
the ability to form one of the two correct disulphides of 
the ectodomain, that is either the Cysl73 to Cysl86 or the 
Cysl76 to Cysl82 linkage. The replacement of the cysteine 

10 residues by arginine residues in both cases presumably 

compensated for the loss of stability associated with loss 
of a disulphide bond by replacement with a residue wit the 
ability to form a salt bridge. 

The functional importance of the disulphide 

15 region is shown by studies using various polypeptides 

produced by recombinant routes or by chemical synthesis to 
immunize experimental animals against challenge with live 
RSV. A recombinant vaccinia virus expressing a polypeptide 
encompassing the disulphide region (residues 1-230 of the 

20 G protein) has been shown to produce neutralising 

antibodies and to confer protection from challenge with 
live RSV, with a response equivalent to that elicited by a 
recombinant vaccinia virus expressing full length 
G protein. In contrast, a recombinant vaccinia virus 

25 expressing a polypeptide terminating at residue 180 

(residues 1-180) failed to provide protection (Olmstead et 
al, 1989) . Another recombinant vaccinia construct 
encompassing residues 124-203, also conferred protection 
(Simard et al, 1995) . The importance the disulphide region 

30 is also illustrated by finding that a synthetic peptide 
containing three of the cysteine residues 
(residues 174-187), which conferred protection from 
challenge by live RSV despite the fact that only non- 
neutralising serum antibodies were produced (Trudel et al, 

35 1991). 

Nested sets of synthetic peptides have also been 
used to attempt to define the portion of the G protein 
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which interacts with the cellular receptor f or ' RSV (Feldman 
and Hendry, 1996) ; however, none of these peptides from the 
ectodomain blocked binding of the G protein to RSV- 
susceptible cells. It was postulated that the peptides may 
5 have lacked secondary or tertiary structural elements 

required for interaction with receptors. However, as with 
the immunological studies, the lack of oligosaccharides on 
these peptides appears to have been overlooked by previous 
workers. Furthermore, the disulphide bond status of these 

10 peptides does not appear to have been addressed. It is 

conceivable that the structural elements missing from the 
synthetic peptides involve correct disulphide bond pairing 
or another element contained by such disulphide bonds. 

Knowledge of the actual disulphide linkage 

15 pattern of G protein ectodomain has facilitated 

experimental assessment of the functional and immunological 
roles of the disulphides and neighbouring peptide 
sequences. Our results have enabled us to develop 
strategies for synthesis of peptides with the correct 

20 disulphide bridging to probe receptor binding and 
immunological interactions of this portion of the 
G protein. 

The antiviral activity of the non-fluorescent 
peptide derivatives containing various regions of the 149- 

25 197 sequence, including Acl49-177 with acetamidomethyl 
protected cysteinyl residues, shows that the binding of 
f luoresceinyl and biotinyl peptide derivatives evident by 
flow cytometry and electron microscopy, and the influences 
of non-fluorescent peptide derivatives on this binding, 

30 represent biologically relevant interaction (s) between the 
RSV G protein and cellular receptor (s) for the virus. 
Furthermore, these antiviral activities indicate that 
mimics of the structures of these peptide derivatives, 
including non-peptide compounds, and/or the actual peptide 

35 derivatives described herein and/or peptide derivatives 
containing various regions of the peptide derivatives 
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described herein may form the basis of therapeutic control 
of RSV infections. 

The demonstration that derivatives of the 
sequence including residues 149-197 of the G protein of RSV 
5 have the appropriate structure to bind to RSV susceptible 
cells in a manner that reflects the biologically relevant 
event of binding of virions to cells shows that these 
derivatives and/or mimics of the structures of these 
derivatives are suitable components of vaccines for 

10 immunotherapeutic control of RSV. 

Furthermore, we have demonstrated that synthetic 
strategies for this subdomain of the G protein did not need 
to consider glycosylation . Peptide by-products with the 
incorrect disulphide bonding arrangements are theoretically 

15 possible from synthetic approaches; however, these can 

easily be identified by routine methods, and will serve as 
control peptides for assessment of the relevance of the 
particular disulphide pattern determined herein. In fact, 
no such peptide by-products were observed. Peptide 

20 products with residual blocking groups were obtained, and 
can also be used as controls. 

It will be apparent to the person skilled in the 
art that while the invention has been described in some 
25 detail for the purposes of clarity and understanding, 

various modifications and alterations to the embodiments 
and methods described herein may be made without departing 
from the scope of the inventive concept disclosed in this 
specification . 

30 
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CLAIMS 

1- A compound having structural homology to a 
contiguous sequence of amino acids within the sequence 
representing residues 149-197 of the G protein of 

5 respiratory syncytial virus, in which 

a) no oligosaccharide is linked to potential 
serine, threonine or asparagine attachment sites; 

b) four cysteine residues are involved in 
disulphide linkages; and 

10 c) the pattern of disulphide linkage is Cys 173 

linked to Cys 186, and Cys 176 linked to Cys 182, 

and in which said compound possesses a biological 
activity of respiratory syncytial virus G protein. 

2 - A compound according to Claim 1 in which the 
15 virus is selected from the group consisting of human RSV 

subtype A, human RSV subtype B, bovine RSV, and mutants and 
variants thereof. 

3. A compound according to Claim 1 or Claim 2 in 

which the compound is a peptide corresponding to amino 
20 acids 158 to 196 of the RSV G protein. 

4 • A compound according to any one of Claims 1 to 3 

in which the peptide corresponds to amino acids 165 to 187 
of the RSV G protein. 

5 - A compound according to any one of Claims 1 to 4 

25 in which the compound is a peptide having one of the 
following amino acid sequences: 



SEQ ID NO 1 KQRQNKPPSKPNNDFHFEVFNFVPCSICSNNPTCWAICKRIPNKKPGKK 

SEQ ID NO 2 N 

30 SEQ ID NO 3 R 

SEQ ID NO 4 — H 

SEQ ID NO 5 N 

SEQ ID NO 6 N 

SEQ ID NO 7 N 

35 SEQ ID NO 8 R 

SEQ ID NO 9 -S-SKN — K — KD-Y G— QL-KS T — SN K 

SEQ ID NO 10 -S-SKN — K--KD- Y G— QL-KS T — SN--K 
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SEQ 


ID 




1 1 

J. JL 


— P— plfM- — K"— — KTi— Y— — — — — — 


Vj yia— I SN K 


J Hj \/ 


I D 


NO 




_ P — T.PfN K KTi— Y 


— yjj- t\o i — o IN — K — 




t n 


MO 




_ D_ T t<f M T<T Pfn — Y — — — _ 

c J_i I\ LN rv I\ U I — 


Lj UIj J\b l — boN K 


SEQ 


ID 


NO 


14 


-P-LKN — K — KD-Y 


G — QL-KS T — SN — K — 


SEQ 


ID 


NO 


15 


-S-SKN — K — KD-Y 


G--QL-KS T — SN — K-- 


SEQ 


ID 


NO 


16 


NPSGSI — ENHQDHNN-QTLPY- 


T-EG-LA-LSL-HIETERA-SRA 


SEQ 


ID 


NO 


17 






SEQ 


ID 


NO 


18 







10 6. A compound having structural homology to a 

contiguous sequence of amino acids within the sequence 
representing residues 149-197 of the G protein of RSV, in 
which at least one of cysteines 173, 176, 182 and 186 is 
absent or blocked, and in which said compound is not 

15 glycosylated, and has the ability to inhibit infectivity of 
RSV. 

7 . A compound according to any one of Claims 1 to 6 

in which one or more amino acids is replaced by its 
corresponding D-amino acid. 
20 8. A compound according to any one of Claims 1 to 6 

which is a peptidomimetic compound. 

9. A compound according to any one of Claims 1 to 6 

in which one or more individual amino acids is replaced by 
an analogous structure. 
25 10. A diagnostic composition comprising a compound 

according to any one of Claims 1 to 10 together with an 
acceptable carrier. 

11. A pharmaceutical composition comprising a 
compound according to any one of Claims 1 to 10 together 

30 with a pharmaceutically acceptable carrier. 

12. An antibody directed against a compound according 
to any one of Claims 1 to 10. 

13. An antibody according to Claim 12 which is a 
protective antibody . 

35 14. A composition comprising an antibody according to 

Claim 12 or Claim 13. 
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15. A composition according to any one of Claims 10 
to 12 or 13 in which the virus is human RSV. 

16. A method of prevention or treatment of 
Pneumovirus infection comprising the step of administering 
an effective amount of a compound according to any one of 
Claims 1 to 10 to a mammal in need of such treatment. 

17. A method of diagnosis of Pneumovirus infection, 
comprising exposing a biological fluid or sample from a 
mammal suspected' of being infected with said virus to a 
compound according to any one of Claims 1 to 10, and 
measuring the interaction between the compound and said 
fluid or sample. 

18. A method of immunisation against Pneumovirus 
infection, comprising the step of immunising a mammal at 
risk of such infection with an immunising-ef f ective dose of 
according to any one of Claims 1 to 10, said compound being 
immunogenic and having the ability to elicit protective 
antibody . 
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FIGURE 3A 
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FIGURE 4A 
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